Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 10000 |
| Missing cells | 8704 |
| Missing cells (%) | 3.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 539.6 KiB |
| Average record size in memory | 55.3 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 15 |
age has a high cardinality: 100 distinct values | High cardinality |
hhwt is highly correlated with perwt | High correlation |
perwt is highly correlated with hhwt | High correlation |
sample is highly correlated with country | High correlation |
empstat is highly correlated with empstatd | High correlation |
indig is highly correlated with race | High correlation |
country is highly correlated with sample | High correlation |
race is highly correlated with indig | High correlation |
edattain is highly correlated with edattaind | High correlation |
empstatd is highly correlated with empstat | High correlation |
edattaind is highly correlated with edattain | High correlation |
internet has 1801 (18.0%) missing values | Missing |
race has 4756 (47.6%) missing values | Missing |
indig has 2147 (21.5%) missing values | Missing |
df_index has unique values | Unique |
Reproduction
| Analysis started | 2021-01-11 17:02:54.221857 |
|---|---|
| Analysis finished | 2021-01-11 17:03:08.978034 |
| Duration | 14.76 seconds |
| Software version | pandas-profiling v2.10.0 |
| Download configuration | config.yaml |
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26064256.58 |
|---|---|
| Minimum | 14710 |
| Maximum | 52546277 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 14710 |
|---|---|
| 5-th percentile | 2609690.45 |
| Q1 | 12934202.25 |
| median | 26004179 |
| Q3 | 38974435.25 |
| 95-th percentile | 49747660.85 |
| Maximum | 52546277 |
| Range | 52531567 |
| Interquartile range (IQR) | 26040233 |
Descriptive statistics
| Standard deviation | 15124513.48 |
|---|---|
| Coefficient of variation (CV) | 0.5802779538 |
| Kurtosis | -1.191586877 |
| Mean | 26064256.58 |
| Median Absolute Deviation (MAD) | 13014604 |
| Skewness | 0.006525841157 |
| Sum | 2.606425658 × 1011 |
| Variance | 2.287509079 × 1014 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 49891327 | 1 | < 0.1% |
| 46624015 | 1 | < 0.1% |
| 25242903 | 1 | < 0.1% |
| 42401065 | 1 | < 0.1% |
| 49380629 | 1 | < 0.1% |
| 42775828 | 1 | < 0.1% |
| 46899191 | 1 | < 0.1% |
| 33755657 | 1 | < 0.1% |
| 45644707 | 1 | < 0.1% |
| 34504074 | 1 | < 0.1% |
| Other values (9990) | 9990 |
| Value | Count | Frequency (%) |
| 14710 | 1 | |
| 27684 | 1 | |
| 29925 | 1 | |
| 34204 | 1 | |
| 35160 | 1 |
| Value | Count | Frequency (%) |
| 52546277 | 1 | |
| 52545139 | 1 | |
| 52535356 | 1 | |
| 52534316 | 1 | |
| 52530219 | 1 |
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.6 KiB |
| brazil | |
|---|---|
| mexico | |
| colombia | |
| argentina | |
| peru | |
| Other values (11) |
Length
| Max length | 18 |
|---|---|
| Median length | 6 |
| Mean length | 6.7396 |
| Min length | 4 |
Characters and Unicode
| Total characters | 67396 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | brazil |
|---|---|
| 2nd row | mexico |
| 3rd row | brazil |
| 4th row | mexico |
| 5th row | argentina |
| Value | Count | Frequency (%) |
| brazil | 3946 | |
| mexico | 2144 | |
| colombia | 762 | 7.6% |
| argentina | 761 | 7.6% |
| peru | 519 | 5.2% |
| venezuela | 417 | 4.2% |
| chile | 307 | 3.1% |
| ecuador | 297 | 3.0% |
| dominican republic | 172 | 1.7% |
| haiti | 145 | 1.5% |
| Other values (6) | 530 | 5.3% |
| Value | Count | Frequency (%) |
| brazil | 3946 | |
| mexico | 2144 | |
| colombia | 762 | 7.4% |
| argentina | 761 | 7.3% |
| peru | 519 | 5.0% |
| venezuela | 417 | 4.0% |
| chile | 307 | 3.0% |
| ecuador | 297 | 2.9% |
| republic | 172 | 1.7% |
| dominican | 172 | 1.7% |
| Other values (9) | 865 | 8.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 8904 | |
| a | 8297 | |
| r | 6162 | |
| l | 5818 | |
| e | 5558 | |
| b | 4880 | |
| o | 4460 | 6.6% |
| z | 4363 | 6.5% |
| c | 4115 | 6.1% |
| m | 3141 | 4.7% |
| Other values (12) | 11698 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 67034 | |
| Space Separator | 362 | 0.5% |
Most frequent character per category
| Value | Count | Frequency (%) |
| i | 8904 | |
| a | 8297 | |
| r | 6162 | |
| l | 5818 | |
| e | 5558 | |
| b | 4880 | |
| o | 4460 | 6.7% |
| z | 4363 | 6.5% |
| c | 4115 | 6.1% |
| m | 3141 | 4.7% |
| Other values (11) | 11336 |
| Value | Count | Frequency (%) |
| 362 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 67034 | |
| Common | 362 | 0.5% |
Most frequent character per script
| Value | Count | Frequency (%) |
| i | 8904 | |
| a | 8297 | |
| r | 6162 | |
| l | 5818 | |
| e | 5558 | |
| b | 4880 | |
| o | 4460 | 6.7% |
| z | 4363 | 6.5% |
| c | 4115 | 6.1% |
| m | 3141 | 4.7% |
| Other values (11) | 11336 |
| Value | Count | Frequency (%) |
| 362 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 67396 |
Most frequent character per block
| Value | Count | Frequency (%) |
| i | 8904 | |
| a | 8297 | |
| r | 6162 | |
| l | 5818 | |
| e | 5558 | |
| b | 4880 | |
| o | 4460 | 6.6% |
| z | 4363 | 6.5% |
| c | 4115 | 6.1% |
| m | 3141 | 4.7% |
| Other values (12) | 11698 |
year
Real number (ℝ≥0)
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2009.6268 |
|---|---|
| Minimum | 2001 |
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 2001 |
|---|---|
| 5-th percentile | 2001 |
| Q1 | 2010 |
| median | 2010 |
| Q3 | 2010 |
| 95-th percentile | 2015 |
| Maximum | 2015 |
| Range | 14 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.882219587 |
|---|---|
| Coefficient of variation (CV) | 0.001931811213 |
| Kurtosis | -0.1044309751 |
| Mean | 2009.6268 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.503504689 |
| Sum | 20096268 |
| Variance | 15.07162892 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 2010 | 5239 | |
| 2015 | 2144 | |
| 2005 | 857 | 8.6% |
| 2007 | 626 | 6.3% |
| 2001 | 550 | 5.5% |
| 2002 | 307 | 3.1% |
| 2003 | 145 | 1.5% |
| 2011 | 132 | 1.3% |
| Value | Count | Frequency (%) |
| 2001 | 550 | |
| 2002 | 307 | 3.1% |
| 2003 | 145 | 1.5% |
| 2005 | 857 | |
| 2007 | 626 |
| Value | Count | Frequency (%) |
| 2015 | 2144 | |
| 2011 | 132 | 1.3% |
| 2010 | 5239 | |
| 2007 | 626 | 6.3% |
| 2005 | 857 | 8.6% |
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.6 KiB |
| brazil 2010 | |
|---|---|
| mexico 2015 | |
| colombia 2005 | |
| argentina 2010 | |
| peru 2007 | |
| Other values (11) |
Length
| Max length | 23 |
|---|---|
| Median length | 11 |
| Mean length | 11.7396 |
| Min length | 9 |
Characters and Unicode
| Total characters | 117396 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | brazil 2010 |
|---|---|
| 2nd row | mexico 2015 |
| 3rd row | brazil 2010 |
| 4th row | mexico 2015 |
| 5th row | argentina 2010 |
| Value | Count | Frequency (%) |
| brazil 2010 | 3946 | |
| mexico 2015 | 2144 | |
| colombia 2005 | 762 | 7.6% |
| argentina 2010 | 761 | 7.6% |
| peru 2007 | 519 | 5.2% |
| venezuela 2001 | 417 | 4.2% |
| chile 2002 | 307 | 3.1% |
| ecuador 2010 | 297 | 3.0% |
| dominican republic 2010 | 172 | 1.7% |
| haiti 2003 | 145 | 1.5% |
| Other values (6) | 530 | 5.3% |
| Value | Count | Frequency (%) |
| 2010 | 5239 | |
| brazil | 3946 | |
| 2015 | 2144 | |
| mexico | 2144 | |
| 2005 | 857 | 4.2% |
| colombia | 762 | 3.7% |
| argentina | 761 | 3.7% |
| 2007 | 626 | 3.1% |
| 2001 | 550 | 2.7% |
| peru | 519 | 2.5% |
| Other values (17) | 2814 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 17724 | |
| 10362 | 8.8% | |
| 2 | 10307 | 8.8% |
| i | 8904 | 7.6% |
| a | 8297 | 7.1% |
| 1 | 8197 | 7.0% |
| r | 6162 | 5.2% |
| l | 5818 | 5.0% |
| e | 5558 | 4.7% |
| b | 4880 | 4.2% |
| Other values (18) | 31187 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 67034 | |
| Decimal Number | 40000 | |
| Space Separator | 10362 | 8.8% |
Most frequent character per category
| Value | Count | Frequency (%) |
| i | 8904 | |
| a | 8297 | |
| r | 6162 | |
| l | 5818 | |
| e | 5558 | |
| b | 4880 | |
| o | 4460 | 6.7% |
| z | 4363 | 6.5% |
| c | 4115 | 6.1% |
| m | 3141 | 4.7% |
| Other values (11) | 11336 |
| Value | Count | Frequency (%) |
| 0 | 17724 | |
| 2 | 10307 | |
| 1 | 8197 | |
| 5 | 3001 | 7.5% |
| 7 | 626 | 1.6% |
| 3 | 145 | 0.4% |
| Value | Count | Frequency (%) |
| 10362 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 67034 | |
| Common | 50362 |
Most frequent character per script
| Value | Count | Frequency (%) |
| i | 8904 | |
| a | 8297 | |
| r | 6162 | |
| l | 5818 | |
| e | 5558 | |
| b | 4880 | |
| o | 4460 | 6.7% |
| z | 4363 | 6.5% |
| c | 4115 | 6.1% |
| m | 3141 | 4.7% |
| Other values (11) | 11336 |
| Value | Count | Frequency (%) |
| 0 | 17724 | |
| 10362 | ||
| 2 | 10307 | |
| 1 | 8197 | |
| 5 | 3001 | 6.0% |
| 7 | 626 | 1.2% |
| 3 | 145 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 117396 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 0 | 17724 | |
| 10362 | 8.8% | |
| 2 | 10307 | 8.8% |
| i | 8904 | 7.6% |
| a | 8297 | 7.1% |
| 1 | 8197 | 7.0% |
| r | 6162 | 5.2% |
| l | 5818 | 5.0% |
| e | 5558 | 4.7% |
| b | 4880 | 4.2% |
| Other values (18) | 31187 |
serial
Real number (ℝ≥0)
| Distinct | 9992 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1608447907 |
|---|---|
| Minimum | 112001 |
| Maximum | 6189373000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 112001 |
|---|---|
| 5-th percentile | 44133550.05 |
| Q1 | 300191000.2 |
| median | 893770000 |
| Q3 | 2499659000 |
| 95-th percentile | 5341378550 |
| Maximum | 6189373000 |
| Range | 6189260999 |
| Interquartile range (IQR) | 2199468000 |
Descriptive statistics
| Standard deviation | 1679117758 |
|---|---|
| Coefficient of variation (CV) | 1.043936674 |
| Kurtosis | 0.2236980339 |
| Mean | 1608447907 |
| Median Absolute Deviation (MAD) | 763431999.5 |
| Skewness | 1.151049553 |
| Sum | 1.608447907 × 1013 |
| Variance | 2.819436445 × 1018 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1038852000 | 2 | < 0.1% |
| 179516001 | 2 | < 0.1% |
| 1714242000 | 2 | < 0.1% |
| 483438001 | 2 | < 0.1% |
| 2188438000 | 2 | < 0.1% |
| 68159000 | 2 | < 0.1% |
| 594681000 | 2 | < 0.1% |
| 8729000 | 2 | < 0.1% |
| 26910000 | 1 | < 0.1% |
| 181732001 | 1 | < 0.1% |
| Other values (9982) | 9982 |
| Value | Count | Frequency (%) |
| 112001 | 1 | |
| 319001 | 1 | |
| 465000 | 1 | |
| 465001 | 1 | |
| 632000 | 1 |
| Value | Count | Frequency (%) |
| 6189373000 | 1 | |
| 6189295000 | 1 | |
| 6187806000 | 1 | |
| 6187616000 | 1 | |
| 6186946000 | 1 |
persons
Real number (ℝ≥0)
| Distinct | 25 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.6389 |
|---|---|
| Minimum | 1 |
| Maximum | 28 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 9 |
| Maximum | 28 |
| Range | 27 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.289897629 |
|---|---|
| Coefficient of variation (CV) | 0.4936294444 |
| Kurtosis | 7.761745766 |
| Mean | 4.6389 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.727309938 |
| Sum | 46389 |
| Variance | 5.243631153 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 2303 | |
| 3 | 1772 | |
| 5 | 1757 | |
| 6 | 1165 | |
| 2 | 1054 | |
| 7 | 688 | 6.9% |
| 1 | 350 | 3.5% |
| 8 | 345 | 3.5% |
| 9 | 234 | 2.3% |
| 10 | 143 | 1.4% |
| Other values (15) | 189 | 1.9% |
| Value | Count | Frequency (%) |
| 1 | 350 | 3.5% |
| 2 | 1054 | |
| 3 | 1772 | |
| 4 | 2303 | |
| 5 | 1757 |
| Value | Count | Frequency (%) |
| 28 | 2 | |
| 27 | 1 | |
| 24 | 2 | |
| 22 | 1 | |
| 21 | 1 |
| Distinct | 1792 |
|---|---|
| Distinct (%) | 17.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.731379 |
|---|---|
| Minimum | 0 |
| Maximum | 198 |
| Zeros | 9 |
| Zeros (%) | 0.1% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4.74 |
| median | 10 |
| Q3 | 10 |
| 95-th percentile | 22.123 |
| Maximum | 198 |
| Range | 198 |
| Interquartile range (IQR) | 5.26 |
Descriptive statistics
| Standard deviation | 8.743821307 |
|---|---|
| Coefficient of variation (CV) | 0.8985182169 |
| Kurtosis | 61.21473892 |
| Mean | 9.731379 |
| Median Absolute Deviation (MAD) | 2.46 |
| Skewness | 5.820674417 |
| Sum | 97313.79 |
| Variance | 76.45441105 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 3258 | |
| 2 | 535 | 5.3% |
| 4 | 522 | 5.2% |
| 6 | 315 | 3.1% |
| 4.64 | 283 | 2.8% |
| 8 | 193 | 1.9% |
| 12 | 71 | 0.7% |
| 16 | 54 | 0.5% |
| 14 | 42 | 0.4% |
| 22 | 33 | 0.3% |
| Other values (1782) | 4694 |
| Value | Count | Frequency (%) |
| 0 | 9 | |
| 1 | 10 | |
| 1.03 | 1 | < 0.1% |
| 1.04 | 1 | < 0.1% |
| 1.07 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 198 | 1 | |
| 148 | 1 | |
| 127.39 | 1 | |
| 120.12 | 1 | |
| 118 | 1 |
gq
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.1 KiB |
| households | |
|---|---|
| other group quarters | 26 |
| institutions | 12 |
| 1-person unit created by splitting large household | 9 |
| group quarters (collective), n.s | 5 |
Length
| Max length | 50 |
|---|---|
| Median length | 10 |
| Mean length | 10.0754 |
| Min length | 10 |
Characters and Unicode
| Total characters | 100754 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | households |
|---|---|
| 2nd row | households |
| 3rd row | households |
| 4th row | households |
| 5th row | households |
| Value | Count | Frequency (%) |
| households | 9948 | |
| other group quarters | 26 | 0.3% |
| institutions | 12 | 0.1% |
| 1-person unit created by splitting large household | 9 | 0.1% |
| group quarters (collective), n.s | 5 | 0.1% |
| Value | Count | Frequency (%) |
| households | 9948 | |
| quarters | 31 | 0.3% |
| group | 31 | 0.3% |
| other | 26 | 0.3% |
| institutions | 12 | 0.1% |
| household | 9 | 0.1% |
| unit | 9 | 0.1% |
| 1-person | 9 | 0.1% |
| splitting | 9 | 0.1% |
| large | 9 | 0.1% |
| Other values (4) | 28 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 19997 | |
| s | 19983 | |
| h | 19940 | |
| e | 10060 | |
| u | 10040 | |
| l | 9985 | |
| d | 9966 | |
| r | 146 | 0.1% |
| t | 134 | 0.1% |
| 121 | 0.1% | |
| Other values (16) | 382 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 100595 | |
| Space Separator | 121 | 0.1% |
| Other Punctuation | 10 | < 0.1% |
| Decimal Number | 9 | < 0.1% |
| Dash Punctuation | 9 | < 0.1% |
| Open Punctuation | 5 | < 0.1% |
| Close Punctuation | 5 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| o | 19997 | |
| s | 19983 | |
| h | 19940 | |
| e | 10060 | |
| u | 10040 | |
| l | 9985 | |
| d | 9966 | |
| r | 146 | 0.1% |
| t | 134 | 0.1% |
| i | 68 | 0.1% |
| Other values (9) | 276 | 0.3% |
| Value | Count | Frequency (%) |
| , | 5 | |
| . | 5 |
| Value | Count | Frequency (%) |
| 121 |
| Value | Count | Frequency (%) |
| 1 | 9 |
| Value | Count | Frequency (%) |
| - | 9 |
| Value | Count | Frequency (%) |
| ( | 5 |
| Value | Count | Frequency (%) |
| ) | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 100595 | |
| Common | 159 | 0.2% |
Most frequent character per script
| Value | Count | Frequency (%) |
| o | 19997 | |
| s | 19983 | |
| h | 19940 | |
| e | 10060 | |
| u | 10040 | |
| l | 9985 | |
| d | 9966 | |
| r | 146 | 0.1% |
| t | 134 | 0.1% |
| i | 68 | 0.1% |
| Other values (9) | 276 | 0.3% |
| Value | Count | Frequency (%) |
| 121 | ||
| 1 | 9 | 5.7% |
| - | 9 | 5.7% |
| ( | 5 | 3.1% |
| ) | 5 | 3.1% |
| , | 5 | 3.1% |
| . | 5 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 100754 |
Most frequent character per block
| Value | Count | Frequency (%) |
| o | 19997 | |
| s | 19983 | |
| h | 19940 | |
| e | 10060 | |
| u | 10040 | |
| l | 9985 | |
| d | 9966 | |
| r | 146 | 0.1% |
| t | 134 | 0.1% |
| 121 | 0.1% | |
| Other values (16) | 382 | 0.4% |
geolev1
Real number (ℝ≥0)
| Distinct | 291 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 257790.0231 |
|---|---|
| Minimum | 32002 |
| Maximum | 862023 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 32002 |
|---|---|
| 5-th percentile | 32034 |
| Q1 | 76031 |
| median | 152132 |
| Q3 | 484014 |
| 95-th percentile | 604021 |
| Maximum | 862023 |
| Range | 830021 |
| Interquartile range (IQR) | 407983 |
Descriptive statistics
| Standard deviation | 232148.4976 |
|---|---|
| Coefficient of variation (CV) | 0.9005332897 |
| Kurtosis | -0.1431791563 |
| Mean | 257790.0231 |
| Median Absolute Deviation (MAD) | 76109 |
| Skewness | 0.9641215669 |
| Sum | 2577900231 |
| Variance | 5.389292492 × 1010 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 76035 | 720 | 7.2% |
| 76031 | 474 | 4.7% |
| 32006 | 312 | 3.1% |
| 76029 | 282 | 2.8% |
| 76043 | 273 | 2.7% |
| 76041 | 233 | 2.3% |
| 76052 | 203 | 2.0% |
| 484020 | 200 | 2.0% |
| 76033 | 195 | 1.9% |
| 218009 | 192 | 1.9% |
| Other values (281) | 6916 |
| Value | Count | Frequency (%) |
| 32002 | 60 | 0.6% |
| 32006 | 312 | |
| 32010 | 5 | 0.1% |
| 32014 | 54 | 0.5% |
| 32018 | 20 | 0.2% |
| Value | Count | Frequency (%) |
| 862023 | 46 | |
| 862022 | 7 | 0.1% |
| 862021 | 12 | 0.1% |
| 862020 | 26 | |
| 862019 | 12 | 0.1% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1801 |
| Missing (%) | 18.0% |
| Memory size | 10.1 KiB |
| no | |
|---|---|
| niu (not in universe) | |
| yes | |
| unknown | 41 |
Length
| Max length | 21 |
|---|---|
| Median length | 3 |
| Mean length | 8.612269789 |
| Min length | 2 |
Characters and Unicode
| Total characters | 70612 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | niu (not in universe) |
|---|---|
| 2nd row | yes |
| 3rd row | niu (not in universe) |
| 4th row | no |
| 5th row | yes |
| Value | Count | Frequency (%) |
| no | 3865 | |
| niu (not in universe) | 2762 | |
| yes | 1531 | 15.3% |
| unknown | 41 | 0.4% |
| (Missing) | 1801 |
| Value | Count | Frequency (%) |
| no | 3865 | |
| not | 2762 | |
| in | 2762 | |
| niu | 2762 | |
| universe | 2762 | |
| yes | 1531 | 9.3% |
| unknown | 41 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 15036 | |
| i | 8286 | |
| 8286 | ||
| e | 7055 | |
| o | 6668 | |
| u | 5565 | 7.9% |
| s | 4293 | 6.1% |
| ( | 2762 | 3.9% |
| t | 2762 | 3.9% |
| v | 2762 | 3.9% |
| Other values (5) | 7137 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 56802 | |
| Space Separator | 8286 | 11.7% |
| Open Punctuation | 2762 | 3.9% |
| Close Punctuation | 2762 | 3.9% |
Most frequent character per category
| Value | Count | Frequency (%) |
| n | 15036 | |
| i | 8286 | |
| e | 7055 | |
| o | 6668 | |
| u | 5565 | 9.8% |
| s | 4293 | 7.6% |
| t | 2762 | 4.9% |
| v | 2762 | 4.9% |
| r | 2762 | 4.9% |
| y | 1531 | 2.7% |
| Other values (2) | 82 | 0.1% |
| Value | Count | Frequency (%) |
| 8286 |
| Value | Count | Frequency (%) |
| ( | 2762 |
| Value | Count | Frequency (%) |
| ) | 2762 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 56802 | |
| Common | 13810 | 19.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| n | 15036 | |
| i | 8286 | |
| e | 7055 | |
| o | 6668 | |
| u | 5565 | 9.8% |
| s | 4293 | 7.6% |
| t | 2762 | 4.9% |
| v | 2762 | 4.9% |
| r | 2762 | 4.9% |
| y | 1531 | 2.7% |
| Other values (2) | 82 | 0.1% |
| Value | Count | Frequency (%) |
| 8286 | ||
| ( | 2762 | 20.0% |
| ) | 2762 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 70612 |
Most frequent character per block
| Value | Count | Frequency (%) |
| n | 15036 | |
| i | 8286 | |
| 8286 | ||
| e | 7055 | |
| o | 6668 | |
| u | 5565 | 7.9% |
| s | 4293 | 6.1% |
| ( | 2762 | 3.9% |
| t | 2762 | 3.9% |
| v | 2762 | 3.9% |
| Other values (5) | 7137 |
computer
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.1 KiB |
| no | |
|---|---|
| yes | |
| niu (not in universe) | 41 |
| unknown/missing | 34 |
Length
| Max length | 21 |
|---|---|
| Median length | 2 |
| Mean length | 2.3765 |
| Min length | 2 |
Characters and Unicode
| Total characters | 23765 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | no |
|---|---|
| 2nd row | no |
| 3rd row | no |
| 4th row | yes |
| 5th row | yes |
| Value | Count | Frequency (%) |
| no | 7381 | |
| yes | 2544 | 25.4% |
| niu (not in universe) | 41 | 0.4% |
| unknown/missing | 34 | 0.3% |
| Value | Count | Frequency (%) |
| no | 7381 | |
| yes | 2544 | 25.1% |
| not | 41 | 0.4% |
| in | 41 | 0.4% |
| niu | 41 | 0.4% |
| universe | 41 | 0.4% |
| unknown/missing | 34 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 7681 | |
| o | 7456 | |
| s | 2653 | 11.2% |
| e | 2626 | 11.0% |
| y | 2544 | 10.7% |
| i | 191 | 0.8% |
| 123 | 0.5% | |
| u | 116 | 0.5% |
| ( | 41 | 0.2% |
| t | 41 | 0.2% |
| Other values (8) | 293 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23526 | |
| Space Separator | 123 | 0.5% |
| Open Punctuation | 41 | 0.2% |
| Close Punctuation | 41 | 0.2% |
| Other Punctuation | 34 | 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| n | 7681 | |
| o | 7456 | |
| s | 2653 | 11.3% |
| e | 2626 | 11.2% |
| y | 2544 | 10.8% |
| i | 191 | 0.8% |
| u | 116 | 0.5% |
| t | 41 | 0.2% |
| v | 41 | 0.2% |
| r | 41 | 0.2% |
| Other values (4) | 136 | 0.6% |
| Value | Count | Frequency (%) |
| 123 |
| Value | Count | Frequency (%) |
| ( | 41 |
| Value | Count | Frequency (%) |
| ) | 41 |
| Value | Count | Frequency (%) |
| / | 34 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23526 | |
| Common | 239 | 1.0% |
Most frequent character per script
| Value | Count | Frequency (%) |
| n | 7681 | |
| o | 7456 | |
| s | 2653 | 11.3% |
| e | 2626 | 11.2% |
| y | 2544 | 10.8% |
| i | 191 | 0.8% |
| u | 116 | 0.5% |
| t | 41 | 0.2% |
| v | 41 | 0.2% |
| r | 41 | 0.2% |
| Other values (4) | 136 | 0.6% |
| Value | Count | Frequency (%) |
| 123 | ||
| ( | 41 | 17.2% |
| ) | 41 | 17.2% |
| / | 34 | 14.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23765 |
Most frequent character per block
| Value | Count | Frequency (%) |
| n | 7681 | |
| o | 7456 | |
| s | 2653 | 11.2% |
| e | 2626 | 11.0% |
| y | 2544 | 10.7% |
| i | 191 | 0.8% |
| 123 | 0.5% | |
| u | 116 | 0.5% |
| ( | 41 | 0.2% |
| t | 41 | 0.2% |
| Other values (8) | 293 | 1.2% |
pernum
Real number (ℝ≥0)
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8293 |
|---|---|
| Minimum | 1 |
| Maximum | 22 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 9.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 22 |
| Range | 21 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.846917058 |
|---|---|
| Coefficient of variation (CV) | 0.6527823343 |
| Kurtosis | 4.22190227 |
| Mean | 2.8293 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.534156834 |
| Sum | 28293 |
| Variance | 3.41110262 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2771 | |
| 2 | 2442 | |
| 3 | 1885 | |
| 4 | 1325 | |
| 5 | 732 | 7.3% |
| 6 | 401 | 4.0% |
| 7 | 209 | 2.1% |
| 8 | 114 | 1.1% |
| 9 | 56 | 0.6% |
| 10 | 29 | 0.3% |
| Other values (7) | 36 | 0.4% |
| Value | Count | Frequency (%) |
| 1 | 2771 | |
| 2 | 2442 | |
| 3 | 1885 | |
| 4 | 1325 | |
| 5 | 732 | 7.3% |
| Value | Count | Frequency (%) |
| 22 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 3 | |
| 14 | 3 | |
| 13 | 4 |
| Distinct | 1790 |
|---|---|
| Distinct (%) | 17.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.741712 |
|---|---|
| Minimum | 1 |
| Maximum | 198 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4.79 |
| median | 10 |
| Q3 | 10 |
| 95-th percentile | 22.123 |
| Maximum | 198 |
| Range | 197 |
| Interquartile range (IQR) | 5.21 |
Descriptive statistics
| Standard deviation | 8.738742653 |
|---|---|
| Coefficient of variation (CV) | 0.897043831 |
| Kurtosis | 61.33514744 |
| Mean | 9.741712 |
| Median Absolute Deviation (MAD) | 2.435 |
| Skewness | 5.828609816 |
| Sum | 97417.12 |
| Variance | 76.36562315 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 3267 | |
| 2 | 535 | 5.3% |
| 4 | 522 | 5.2% |
| 6 | 315 | 3.1% |
| 4.64 | 283 | 2.8% |
| 8 | 193 | 1.9% |
| 12 | 71 | 0.7% |
| 16 | 54 | 0.5% |
| 14 | 42 | 0.4% |
| 22 | 33 | 0.3% |
| Other values (1780) | 4685 |
| Value | Count | Frequency (%) |
| 1 | 10 | |
| 1.03 | 1 | < 0.1% |
| 1.04 | 1 | < 0.1% |
| 1.07 | 1 | < 0.1% |
| 1.08 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 198 | 1 | |
| 148 | 1 | |
| 127.39 | 1 | |
| 120.12 | 1 | |
| 118 | 1 |
| Distinct | 100 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.7 KiB |
| 13 | 211 |
|---|---|
| 5 | 208 |
| 12 | 208 |
| 15 | 202 |
| 20 | 201 |
| Other values (95) |
Length
| Max length | 20 |
|---|---|
| Median length | 2 |
| Mean length | 2.2885 |
| Min length | 1 |
Characters and Unicode
| Total characters | 22885 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 25 |
|---|---|
| 2nd row | 13 |
| 3rd row | 3 |
| 4th row | 42 |
| 5th row | 57 |
| Value | Count | Frequency (%) |
| 13 | 211 | 2.1% |
| 5 | 208 | 2.1% |
| 12 | 208 | 2.1% |
| 15 | 202 | 2.0% |
| 20 | 201 | 2.0% |
| 3 | 200 | 2.0% |
| 14 | 195 | 1.9% |
| 11 | 194 | 1.9% |
| 18 | 193 | 1.9% |
| 9 | 192 | 1.9% |
| Other values (90) | 7996 |
| Value | Count | Frequency (%) |
| 1 | 365 | 3.3% |
| year | 365 | 3.3% |
| 13 | 211 | 1.9% |
| 12 | 208 | 1.9% |
| 5 | 208 | 1.9% |
| 15 | 202 | 1.9% |
| 20 | 201 | 1.8% |
| 3 | 200 | 1.8% |
| 14 | 195 | 1.8% |
| 11 | 194 | 1.8% |
| Other values (94) | 8562 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3120 | |
| 2 | 2720 | |
| 3 | 2494 | |
| 4 | 2133 | |
| 5 | 1947 | |
| 6 | 1501 | 6.6% |
| 7 | 1277 | 5.6% |
| 8 | 1091 | 4.8% |
| 0 | 983 | 4.3% |
| 911 | 4.0% | |
| Other values (18) | 4708 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18152 | |
| Lowercase Letter | 3819 | 16.7% |
| Space Separator | 911 | 4.0% |
| Math Symbol | 2 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 720 | |
| a | 718 | |
| s | 547 | |
| r | 528 | |
| y | 526 | |
| t | 194 | 5.1% |
| n | 194 | 5.1% |
| l | 192 | 5.0% |
| h | 192 | 5.0% |
| o | 2 | 0.1% |
| Other values (5) | 6 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 3120 | |
| 2 | 2720 | |
| 3 | 2494 | |
| 4 | 2133 | |
| 5 | 1947 | |
| 6 | 1501 | |
| 7 | 1277 | |
| 8 | 1091 | 6.0% |
| 0 | 983 | 5.4% |
| 9 | 886 | 4.9% |
| Value | Count | Frequency (%) |
| 911 |
| Value | Count | Frequency (%) |
| / | 1 |
| Value | Count | Frequency (%) |
| + | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19066 | |
| Latin | 3819 | 16.7% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 720 | |
| a | 718 | |
| s | 547 | |
| r | 528 | |
| y | 526 | |
| t | 194 | 5.1% |
| n | 194 | 5.1% |
| l | 192 | 5.0% |
| h | 192 | 5.0% |
| o | 2 | 0.1% |
| Other values (5) | 6 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 3120 | |
| 2 | 2720 | |
| 3 | 2494 | |
| 4 | 2133 | |
| 5 | 1947 | |
| 6 | 1501 | |
| 7 | 1277 | |
| 8 | 1091 | 5.7% |
| 0 | 983 | 5.2% |
| 911 | 4.8% | |
| Other values (3) | 889 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22885 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 1 | 3120 | |
| 2 | 2720 | |
| 3 | 2494 | |
| 4 | 2133 | |
| 5 | 1947 | |
| 6 | 1501 | 6.6% |
| 7 | 1277 | 5.6% |
| 8 | 1091 | 4.8% |
| 0 | 983 | 4.3% |
| 911 | 4.0% | |
| Other values (18) | 4708 |
sex
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.0 KiB |
| male | |
|---|---|
| female |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.996 |
| Min length | 4 |
Characters and Unicode
| Total characters | 49960 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | male |
|---|---|
| 2nd row | female |
| 3rd row | female |
| 4th row | female |
| 5th row | male |
| Value | Count | Frequency (%) |
| male | 5020 | |
| female | 4980 |
| Value | Count | Frequency (%) |
| male | 5020 | |
| female | 4980 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 14980 | |
| m | 10000 | |
| a | 10000 | |
| l | 10000 | |
| f | 4980 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 49960 |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 14980 | |
| m | 10000 | |
| a | 10000 | |
| l | 10000 | |
| f | 4980 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 49960 |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 14980 | |
| m | 10000 | |
| a | 10000 | |
| l | 10000 | |
| f | 4980 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49960 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 14980 | |
| m | 10000 | |
| a | 10000 | |
| l | 10000 | |
| f | 4980 | 10.0% |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 4756 |
| Missing (%) | 47.6% |
| Memory size | 10.6 KiB |
| white | |
|---|---|
| brown (brazil) | |
| black | |
| mestizo (indigenous and white) | |
| indigenous | 79 |
| Other values (7) | 114 |
Length
| Max length | 30 |
|---|---|
| Median length | 5 |
| Mean length | 9.686308162 |
| Min length | 5 |
Characters and Unicode
| Total characters | 50795 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | white |
|---|---|
| 2nd row | white |
| 3rd row | white |
| 4th row | white |
| 5th row | white |
| Value | Count | Frequency (%) |
| white | 2632 | |
| brown (brazil) | 1742 | 17.4% |
| black | 364 | 3.6% |
| mestizo (indigenous and white) | 313 | 3.1% |
| indigenous | 79 | 0.8% |
| asian | 39 | 0.4% |
| montubio (ecuador) | 29 | 0.3% |
| unknown | 23 | 0.2% |
| afro-ecuadorian | 11 | 0.1% |
| mulatto (black and white) | 6 | 0.1% |
| Other values (2) | 6 | 0.1% |
| (Missing) | 4756 |
| Value | Count | Frequency (%) |
| white | 2951 | |
| brown | 1742 | |
| brazil | 1742 | |
| indigenous | 392 | 4.9% |
| black | 370 | 4.6% |
| and | 319 | 4.0% |
| mestizo | 313 | 3.9% |
| asian | 39 | 0.5% |
| ecuador | 29 | 0.4% |
| montubio | 29 | 0.4% |
| Other values (8) | 52 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 5869 | |
| w | 4718 | 9.3% |
| b | 3883 | 7.6% |
| e | 3704 | 7.3% |
| r | 3545 | 7.0% |
| t | 3311 | 6.5% |
| n | 2993 | 5.9% |
| h | 2955 | 5.8% |
| 2734 | 5.4% | |
| o | 2595 | 5.1% |
| Other values (14) | 14488 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 43870 | |
| Space Separator | 2734 | 5.4% |
| Open Punctuation | 2090 | 4.1% |
| Close Punctuation | 2090 | 4.1% |
| Dash Punctuation | 11 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| i | 5869 | |
| w | 4718 | |
| b | 3883 | |
| e | 3704 | |
| r | 3545 | |
| t | 3311 | |
| n | 2993 | 6.8% |
| h | 2955 | 6.7% |
| o | 2595 | 5.9% |
| a | 2579 | 5.9% |
| Other values (10) | 7718 |
| Value | Count | Frequency (%) |
| 2734 |
| Value | Count | Frequency (%) |
| ( | 2090 |
| Value | Count | Frequency (%) |
| ) | 2090 |
| Value | Count | Frequency (%) |
| - | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 43870 | |
| Common | 6925 | 13.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| i | 5869 | |
| w | 4718 | |
| b | 3883 | |
| e | 3704 | |
| r | 3545 | |
| t | 3311 | |
| n | 2993 | 6.8% |
| h | 2955 | 6.7% |
| o | 2595 | 5.9% |
| a | 2579 | 5.9% |
| Other values (10) | 7718 |
| Value | Count | Frequency (%) |
| 2734 | ||
| ( | 2090 | |
| ) | 2090 | |
| - | 11 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 50795 |
Most frequent character per block
| Value | Count | Frequency (%) |
| i | 5869 | |
| w | 4718 | 9.3% |
| b | 3883 | 7.6% |
| e | 3704 | 7.3% |
| r | 3545 | 7.0% |
| t | 3311 | 6.5% |
| n | 2993 | 5.9% |
| h | 2955 | 5.8% |
| 2734 | 5.4% | |
| o | 2595 | 5.1% |
| Other values (14) | 14488 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2147 |
| Missing (%) | 21.5% |
| Memory size | 10.0 KiB |
| no | |
|---|---|
| yes | |
| unknown | 43 |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.13752706 |
| Min length | 2 |
Characters and Unicode
| Total characters | 16786 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | no |
|---|---|
| 2nd row | no |
| 3rd row | no |
| 4th row | yes |
| 5th row | no |
| Value | Count | Frequency (%) |
| no | 6945 | |
| yes | 865 | 8.6% |
| unknown | 43 | 0.4% |
| (Missing) | 2147 | 21.5% |
| Value | Count | Frequency (%) |
| no | 6945 | |
| yes | 865 | 11.0% |
| unknown | 43 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 7074 | |
| o | 6988 | |
| y | 865 | 5.2% |
| e | 865 | 5.2% |
| s | 865 | 5.2% |
| u | 43 | 0.3% |
| k | 43 | 0.3% |
| w | 43 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16786 |
Most frequent character per category
| Value | Count | Frequency (%) |
| n | 7074 | |
| o | 6988 | |
| y | 865 | 5.2% |
| e | 865 | 5.2% |
| s | 865 | 5.2% |
| u | 43 | 0.3% |
| k | 43 | 0.3% |
| w | 43 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16786 |
Most frequent character per script
| Value | Count | Frequency (%) |
| n | 7074 | |
| o | 6988 | |
| y | 865 | 5.2% |
| e | 865 | 5.2% |
| s | 865 | 5.2% |
| u | 43 | 0.3% |
| k | 43 | 0.3% |
| w | 43 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16786 |
Most frequent character per block
| Value | Count | Frequency (%) |
| n | 7074 | |
| o | 6988 | |
| y | 865 | 5.2% |
| e | 865 | 5.2% |
| s | 865 | 5.2% |
| u | 43 | 0.3% |
| k | 43 | 0.3% |
| w | 43 | 0.3% |
lit
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.1 KiB |
| yes, literate | |
|---|---|
| no, illiterate | |
| niu (not in universe) | |
| unknown/missing | 46 |
Length
| Max length | 21 |
|---|---|
| Median length | 13 |
| Mean length | 13.7772 |
| Min length | 13 |
Characters and Unicode
| Total characters | 137772 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | yes, literate |
|---|---|
| 2nd row | yes, literate |
| 3rd row | niu (not in universe) |
| 4th row | yes, literate |
| 5th row | yes, literate |
| Value | Count | Frequency (%) |
| yes, literate | 7916 | |
| no, illiterate | 1232 | 12.3% |
| niu (not in universe) | 806 | 8.1% |
| unknown/missing | 46 | 0.5% |
| Value | Count | Frequency (%) |
| yes | 7916 | |
| literate | 7916 | |
| no | 1232 | 5.7% |
| illiterate | 1232 | 5.7% |
| niu | 806 | 3.7% |
| universe | 806 | 3.7% |
| not | 806 | 3.7% |
| in | 806 | 3.7% |
| unknown/missing | 46 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 27824 | |
| t | 19102 | |
| i | 12890 | |
| 11566 | ||
| l | 10380 | 7.5% |
| r | 9954 | 7.2% |
| , | 9148 | 6.6% |
| a | 9148 | 6.6% |
| s | 8814 | 6.4% |
| y | 7916 | 5.7% |
| Other values (11) | 11030 | 8.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 115400 | |
| Space Separator | 11566 | 8.4% |
| Other Punctuation | 9194 | 6.7% |
| Open Punctuation | 806 | 0.6% |
| Close Punctuation | 806 | 0.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 27824 | |
| t | 19102 | |
| i | 12890 | |
| l | 10380 | 9.0% |
| r | 9954 | 8.6% |
| a | 9148 | 7.9% |
| s | 8814 | 7.6% |
| y | 7916 | 6.9% |
| n | 4640 | 4.0% |
| o | 2084 | 1.8% |
| Other values (6) | 2648 | 2.3% |
| Value | Count | Frequency (%) |
| , | 9148 | |
| / | 46 | 0.5% |
| Value | Count | Frequency (%) |
| 11566 |
| Value | Count | Frequency (%) |
| ( | 806 |
| Value | Count | Frequency (%) |
| ) | 806 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 115400 | |
| Common | 22372 | 16.2% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 27824 | |
| t | 19102 | |
| i | 12890 | |
| l | 10380 | 9.0% |
| r | 9954 | 8.6% |
| a | 9148 | 7.9% |
| s | 8814 | 7.6% |
| y | 7916 | 6.9% |
| n | 4640 | 4.0% |
| o | 2084 | 1.8% |
| Other values (6) | 2648 | 2.3% |
| Value | Count | Frequency (%) |
| 11566 | ||
| , | 9148 | |
| ( | 806 | 3.6% |
| ) | 806 | 3.6% |
| / | 46 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 137772 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 27824 | |
| t | 19102 | |
| i | 12890 | |
| 11566 | ||
| l | 10380 | 7.5% |
| r | 9954 | 7.2% |
| , | 9148 | 6.6% |
| a | 9148 | 6.6% |
| s | 8814 | 6.4% |
| y | 7916 | 5.7% |
| Other values (11) | 11030 | 8.0% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.1 KiB |
| less than primary completed | |
|---|---|
| primary completed | |
| secondary completed | |
| university completed | |
| niu (not in universe) | 374 |
Length
| Max length | 27 |
|---|---|
| Median length | 20 |
| Mean length | 21.9258 |
| Min length | 7 |
Characters and Unicode
| Total characters | 219258 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | secondary completed |
|---|---|
| 2nd row | primary completed |
| 3rd row | less than primary completed |
| 4th row | primary completed |
| 5th row | primary completed |
| Value | Count | Frequency (%) |
| less than primary completed | 4338 | |
| primary completed | 3061 | |
| secondary completed | 1701 | 17.0% |
| university completed | 480 | 4.8% |
| niu (not in universe) | 374 | 3.7% |
| unknown | 46 | 0.5% |
| Value | Count | Frequency (%) |
| completed | 9580 | |
| primary | 7399 | |
| less | 4338 | |
| than | 4338 | |
| secondary | 1701 | 5.8% |
| university | 480 | 1.6% |
| niu | 374 | 1.3% |
| universe | 374 | 1.3% |
| not | 374 | 1.3% |
| in | 374 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 26427 | |
| 19378 | 8.8% | |
| r | 17353 | 7.9% |
| m | 16979 | 7.7% |
| p | 16979 | 7.7% |
| t | 14772 | 6.7% |
| l | 13918 | 6.3% |
| a | 13438 | 6.1% |
| o | 11701 | 5.3% |
| c | 11281 | 5.1% |
| Other values (12) | 57032 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 199132 | |
| Space Separator | 19378 | 8.8% |
| Open Punctuation | 374 | 0.2% |
| Close Punctuation | 374 | 0.2% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 26427 | |
| r | 17353 | 8.7% |
| m | 16979 | 8.5% |
| p | 16979 | 8.5% |
| t | 14772 | 7.4% |
| l | 13918 | 7.0% |
| a | 13438 | 6.7% |
| o | 11701 | 5.9% |
| c | 11281 | 5.7% |
| d | 11281 | 5.7% |
| Other values (9) | 45003 |
| Value | Count | Frequency (%) |
| 19378 |
| Value | Count | Frequency (%) |
| ( | 374 |
| Value | Count | Frequency (%) |
| ) | 374 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 199132 | |
| Common | 20126 | 9.2% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 26427 | |
| r | 17353 | 8.7% |
| m | 16979 | 8.5% |
| p | 16979 | 8.5% |
| t | 14772 | 7.4% |
| l | 13918 | 7.0% |
| a | 13438 | 6.7% |
| o | 11701 | 5.9% |
| c | 11281 | 5.7% |
| d | 11281 | 5.7% |
| Other values (9) | 45003 |
| Value | Count | Frequency (%) |
| 19378 | ||
| ( | 374 | 1.9% |
| ) | 374 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 219258 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 26427 | |
| 19378 | 8.8% | |
| r | 17353 | 7.9% |
| m | 16979 | 7.7% |
| p | 16979 | 7.7% |
| t | 14772 | 6.7% |
| l | 13918 | 6.3% |
| a | 13438 | 6.1% |
| o | 11701 | 5.3% |
| c | 11281 | 5.1% |
| Other values (12) | 57032 |
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.6 KiB |
| some primary completed | |
|---|---|
| primary (6 yrs) completed | |
| no schooling | |
| secondary, general track completed | |
| lower secondary general completed | |
| Other values (9) |
Length
| Max length | 36 |
|---|---|
| Median length | 22 |
| Mean length | 23.8095 |
| Min length | 12 |
Characters and Unicode
| Total characters | 238095 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | secondary, general track completed |
|---|---|
| 2nd row | primary (6 yrs) completed |
| 3rd row | no schooling |
| 4th row | lower secondary general completed |
| 5th row | primary (6 yrs) completed |
| Value | Count | Frequency (%) |
| some primary completed | 2245 | |
| primary (6 yrs) completed | 1791 | |
| no schooling | 1615 | |
| secondary, general track completed | 1120 | |
| lower secondary general completed | 1093 | |
| university completed | 480 | 4.8% |
| primary (4 yrs) completed | 478 | 4.8% |
| niu (not in universe) | 374 | 3.7% |
| some college completed | 347 | 3.5% |
| post-secondary technical education | 193 | 1.9% |
| Other values (4) | 264 | 2.6% |
| Value | Count | Frequency (%) |
| completed | 7772 | |
| primary | 4670 | |
| some | 2592 | 7.9% |
| yrs | 2425 | 7.4% |
| secondary | 2275 | 6.9% |
| general | 2213 | 6.7% |
| 6 | 1791 | 5.4% |
| no | 1615 | 4.9% |
| schooling | 1615 | 4.9% |
| track | 1161 | 3.5% |
| Other values (13) | 4758 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 28514 | |
| 22887 | 9.6% | |
| o | 19944 | 8.4% |
| r | 19575 | 8.2% |
| m | 15080 | 6.3% |
| c | 14066 | 5.9% |
| l | 13663 | 5.7% |
| p | 12635 | 5.3% |
| a | 10960 | 4.6% |
| n | 10519 | 4.4% |
| Other values (19) | 70252 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 205785 | |
| Space Separator | 22887 | 9.6% |
| Open Punctuation | 2799 | 1.2% |
| Close Punctuation | 2799 | 1.2% |
| Decimal Number | 2425 | 1.0% |
| Other Punctuation | 1207 | 0.5% |
| Dash Punctuation | 193 | 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 28514 | |
| o | 19944 | 9.7% |
| r | 19575 | 9.5% |
| m | 15080 | 7.3% |
| c | 14066 | 6.8% |
| l | 13663 | 6.6% |
| p | 12635 | 6.1% |
| a | 10960 | 5.3% |
| n | 10519 | 5.1% |
| d | 10433 | 5.1% |
| Other values (10) | 50396 |
| Value | Count | Frequency (%) |
| 6 | 1791 | |
| 4 | 478 | 19.7% |
| 5 | 156 | 6.4% |
| Value | Count | Frequency (%) |
| , | 1161 | |
| / | 46 | 3.8% |
| Value | Count | Frequency (%) |
| 22887 |
| Value | Count | Frequency (%) |
| ( | 2799 |
| Value | Count | Frequency (%) |
| ) | 2799 |
| Value | Count | Frequency (%) |
| - | 193 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 205785 | |
| Common | 32310 | 13.6% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 28514 | |
| o | 19944 | 9.7% |
| r | 19575 | 9.5% |
| m | 15080 | 7.3% |
| c | 14066 | 6.8% |
| l | 13663 | 6.6% |
| p | 12635 | 6.1% |
| a | 10960 | 5.3% |
| n | 10519 | 5.1% |
| d | 10433 | 5.1% |
| Other values (10) | 50396 |
| Value | Count | Frequency (%) |
| 22887 | ||
| ( | 2799 | 8.7% |
| ) | 2799 | 8.7% |
| 6 | 1791 | 5.5% |
| , | 1161 | 3.6% |
| 4 | 478 | 1.5% |
| - | 193 | 0.6% |
| 5 | 156 | 0.5% |
| / | 46 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 238095 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 28514 | |
| 22887 | 9.6% | |
| o | 19944 | 8.4% |
| r | 19575 | 8.2% |
| m | 15080 | 6.3% |
| c | 14066 | 5.9% |
| l | 13663 | 5.7% |
| p | 12635 | 5.3% |
| a | 10960 | 4.6% |
| n | 10519 | 4.4% |
| Other values (19) | 70252 |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.1 KiB |
| inactive | |
|---|---|
| employed | |
| niu (not in universe) | |
| unemployed | 366 |
| unknown/missing | 39 |
Length
| Max length | 21 |
|---|---|
| Median length | 8 |
| Mean length | 10.4769 |
| Min length | 8 |
Characters and Unicode
| Total characters | 104769 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | employed |
|---|---|
| 2nd row | inactive |
| 3rd row | niu (not in universe) |
| 4th row | employed |
| 5th row | employed |
| Value | Count | Frequency (%) |
| inactive | 3982 | |
| employed | 3785 | |
| niu (not in universe) | 1828 | |
| unemployed | 366 | 3.7% |
| unknown/missing | 39 | 0.4% |
| Value | Count | Frequency (%) |
| inactive | 3982 | |
| employed | 3785 | |
| not | 1828 | |
| in | 1828 | |
| niu | 1828 | |
| universe | 1828 | |
| unemployed | 366 | 2.4% |
| unknown/missing | 39 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 15940 | |
| i | 13526 | |
| n | 11816 | |
| o | 6018 | 5.7% |
| t | 5810 | 5.5% |
| v | 5810 | 5.5% |
| 5484 | 5.2% | |
| m | 4190 | 4.0% |
| p | 4151 | 4.0% |
| l | 4151 | 4.0% |
| Other values (13) | 27873 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 95590 | |
| Space Separator | 5484 | 5.2% |
| Open Punctuation | 1828 | 1.7% |
| Close Punctuation | 1828 | 1.7% |
| Other Punctuation | 39 | < 0.1% |
Most frequent character per category
| Value | Count | Frequency (%) |
| e | 15940 | |
| i | 13526 | |
| n | 11816 | |
| o | 6018 | 6.3% |
| t | 5810 | 6.1% |
| v | 5810 | 6.1% |
| m | 4190 | 4.4% |
| p | 4151 | 4.3% |
| l | 4151 | 4.3% |
| y | 4151 | 4.3% |
| Other values (9) | 20027 |
| Value | Count | Frequency (%) |
| 5484 |
| Value | Count | Frequency (%) |
| ( | 1828 |
| Value | Count | Frequency (%) |
| ) | 1828 |
| Value | Count | Frequency (%) |
| / | 39 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 95590 | |
| Common | 9179 | 8.8% |
Most frequent character per script
| Value | Count | Frequency (%) |
| e | 15940 | |
| i | 13526 | |
| n | 11816 | |
| o | 6018 | 6.3% |
| t | 5810 | 6.1% |
| v | 5810 | 6.1% |
| m | 4190 | 4.4% |
| p | 4151 | 4.3% |
| l | 4151 | 4.3% |
| y | 4151 | 4.3% |
| Other values (9) | 20027 |
| Value | Count | Frequency (%) |
| 5484 | ||
| ( | 1828 | 19.9% |
| ) | 1828 | 19.9% |
| / | 39 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104769 |
Most frequent character per block
| Value | Count | Frequency (%) |
| e | 15940 | |
| i | 13526 | |
| n | 11816 | |
| o | 6018 | 5.7% |
| t | 5810 | 5.5% |
| v | 5810 | 5.5% |
| 5484 | 5.2% | |
| m | 4190 | 4.0% |
| p | 4151 | 4.0% |
| l | 4151 | 4.0% |
| Other values (13) | 27873 |
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 11.3 KiB |
| at work | |
|---|---|
| niu (not in universe) | |
| inactive (not in labor force) | |
| housework | |
| in school | |
| Other values (19) |
Length
| Max length | 42 |
|---|---|
| Median length | 9 |
| Mean length | 15.7419 |
| Min length | 7 |
Characters and Unicode
| Total characters | 157419 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | at work |
|---|---|
| 2nd row | in school |
| 3rd row | niu (not in universe) |
| 4th row | at work |
| 5th row | at work |
| Value | Count | Frequency (%) |
| at work | 3542 | |
| niu (not in universe) | 1828 | |
| inactive (not in labor force) | 1713 | |
| housework | 947 | 9.5% |
| in school | 819 | 8.2% |
| inactive, other reasons | 339 | 3.4% |
| unemployed, not specified | 277 | 2.8% |
| have job, not at work in reference period | 84 | 0.8% |
| employed, not specified | 79 | 0.8% |
| permanent disability | 64 | 0.6% |
| Other values (14) | 308 | 3.1% |
| Value | Count | Frequency (%) |
| in | 4444 | |
| not | 3986 | |
| work | 3669 | |
| at | 3647 | |
| inactive | 2055 | |
| universe | 1828 | |
| niu | 1828 | |
| force | 1713 | 5.8% |
| labor | 1713 | 5.8% |
| housework | 947 | 3.2% |
| Other values (36) | 3536 |
Most occurring characters
| Value | Count | Frequency (%) |
| 19366 | ||
| o | 16231 | |
| n | 15670 | 10.0% |
| i | 13686 | 8.7% |
| e | 12118 | 7.7% |
| r | 11434 | 7.3% |
| t | 10335 | 6.6% |
| a | 8269 | 5.3% |
| c | 5097 | 3.2% |
| u | 5050 | 3.2% |
| Other values (19) | 40163 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 129989 | |
| Space Separator | 19366 | 12.3% |
| Open Punctuation | 3541 | 2.2% |
| Close Punctuation | 3541 | 2.2% |
| Other Punctuation | 982 | 0.6% |
Most frequent character per category
| Value | Count | Frequency (%) |
| o | 16231 | |
| n | 15670 | |
| i | 13686 | |
| e | 12118 | |
| r | 11434 | |
| t | 10335 | 8.0% |
| a | 8269 | 6.4% |
| c | 5097 | 3.9% |
| u | 5050 | 3.9% |
| s | 4955 | 3.8% |
| Other values (14) | 27144 |
| Value | Count | Frequency (%) |
| , | 917 | |
| / | 65 | 6.6% |
| Value | Count | Frequency (%) |
| 19366 |
| Value | Count | Frequency (%) |
| ( | 3541 |
| Value | Count | Frequency (%) |
| ) | 3541 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 129989 | |
| Common | 27430 | 17.4% |
Most frequent character per script
| Value | Count | Frequency (%) |
| o | 16231 | |
| n | 15670 | |
| i | 13686 | |
| e | 12118 | |
| r | 11434 | |
| t | 10335 | 8.0% |
| a | 8269 | 6.4% |
| c | 5097 | 3.9% |
| u | 5050 | 3.9% |
| s | 4955 | 3.8% |
| Other values (14) | 27144 |
| Value | Count | Frequency (%) |
| 19366 | ||
| ( | 3541 | 12.9% |
| ) | 3541 | 12.9% |
| , | 917 | 3.3% |
| / | 65 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 157419 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 19366 | ||
| o | 16231 | |
| n | 15670 | 10.0% |
| i | 13686 | 8.7% |
| e | 12118 | 7.7% |
| r | 11434 | 7.3% |
| t | 10335 | 6.6% |
| a | 8269 | 5.3% |
| c | 5097 | 3.2% |
| u | 5050 | 3.2% |
| Other values (19) | 40163 |
labforce
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.1 KiB |
| yes, in the labor force | |
|---|---|
| no, not in the labor force | |
| niu (not in universe) | |
| unknown | 31 |
Length
| Max length | 26 |
|---|---|
| Median length | 23 |
| Mean length | 23.3002 |
| Min length | 7 |
Characters and Unicode
| Total characters | 233002 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | yes, in the labor force |
|---|---|
| 2nd row | niu (not in universe) |
| 3rd row | niu (not in universe) |
| 4th row | yes, in the labor force |
| 5th row | yes, in the labor force |
| Value | Count | Frequency (%) |
| yes, in the labor force | 4068 | |
| no, not in the labor force | 3060 | |
| niu (not in universe) | 2841 | |
| unknown | 31 | 0.3% |
| Value | Count | Frequency (%) |
| in | 9969 | |
| labor | 7128 | |
| the | 7128 | |
| force | 7128 | |
| not | 5901 | |
| yes | 4068 | |
| no | 3060 | 6.1% |
| niu | 2841 | 5.7% |
| universe | 2841 | 5.7% |
| unknown | 31 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 40095 | ||
| n | 24705 | |
| e | 24006 | |
| o | 23248 | |
| r | 17097 | 7.3% |
| i | 15651 | 6.7% |
| t | 13029 | 5.6% |
| , | 7128 | 3.1% |
| h | 7128 | 3.1% |
| l | 7128 | 3.1% |
| Other values (12) | 53787 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 180097 | |
| Space Separator | 40095 | 17.2% |
| Other Punctuation | 7128 | 3.1% |
| Open Punctuation | 2841 | 1.2% |
| Close Punctuation | 2841 | 1.2% |
Most frequent character per category
| Value | Count | Frequency (%) |
| n | 24705 | |
| e | 24006 | |
| o | 23248 | |
| r | 17097 | |
| i | 15651 | |
| t | 13029 | 7.2% |
| h | 7128 | 4.0% |
| l | 7128 | 4.0% |
| a | 7128 | 4.0% |
| b | 7128 | 4.0% |
| Other values (8) | 33849 |
| Value | Count | Frequency (%) |
| , | 7128 |
| Value | Count | Frequency (%) |
| 40095 |
| Value | Count | Frequency (%) |
| ( | 2841 |
| Value | Count | Frequency (%) |
| ) | 2841 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 180097 | |
| Common | 52905 | 22.7% |
Most frequent character per script
| Value | Count | Frequency (%) |
| n | 24705 | |
| e | 24006 | |
| o | 23248 | |
| r | 17097 | |
| i | 15651 | |
| t | 13029 | 7.2% |
| h | 7128 | 4.0% |
| l | 7128 | 4.0% |
| a | 7128 | 4.0% |
| b | 7128 | 4.0% |
| Other values (8) | 33849 |
| Value | Count | Frequency (%) |
| 40095 | ||
| , | 7128 | 13.5% |
| ( | 2841 | 5.4% |
| ) | 2841 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 233002 |
Most frequent character per block
| Value | Count | Frequency (%) |
| 40095 | ||
| n | 24705 | |
| e | 24006 | |
| o | 23248 | |
| r | 17097 | 7.3% |
| i | 15651 | 6.7% |
| t | 13029 | 5.6% |
| , | 7128 | 3.1% |
| h | 7128 | 3.1% |
| l | 7128 | 3.1% |
| Other values (12) | 53787 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| df_index | country | year | sample | serial | persons | hhwt | gq | geolev1 | internet | computer | pernum | perwt | age | sex | race | indig | lit | edattain | edattaind | empstat | empstatd | labforce | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 21899551 | brazil | 2010 | brazil 2010 | 5.311042e+09 | 2 | 2.35 | households | 76043 | niu (not in universe) | no | 1 | 2.35 | 25 | male | white | no | yes, literate | secondary completed | secondary, general track completed | employed | at work | yes, in the labor force |
| 1 | 38283085 | mexico | 2015 | mexico 2015 | 9.875930e+08 | 4 | 50.00 | households | 484014 | yes | no | 2 | 50.00 | 13 | female | NaN | no | yes, literate | primary completed | primary (6 yrs) completed | inactive | in school | niu (not in universe) |
| 2 | 7388087 | brazil | 2010 | brazil 2010 | 8.805020e+08 | 4 | 10.47 | households | 76023 | niu (not in universe) | no | 2 | 10.47 | 3 | female | white | no | niu (not in universe) | less than primary completed | no schooling | niu (not in universe) | niu (not in universe) | niu (not in universe) |
| 3 | 40775034 | mexico | 2015 | mexico 2015 | 1.632011e+09 | 5 | 2.00 | households | 484020 | no | yes | 5 | 2.00 | 42 | female | NaN | yes | yes, literate | primary completed | lower secondary general completed | employed | at work | yes, in the labor force |
| 4 | 1064103 | argentina | 2010 | argentina 2010 | 3.530430e+08 | 3 | 10.00 | households | 32006 | NaN | yes | 1 | 10.00 | 57 | male | NaN | NaN | yes, literate | primary completed | primary (6 yrs) completed | employed | at work | yes, in the labor force |
| 5 | 21924791 | brazil | 2010 | brazil 2010 | 5.319560e+09 | 3 | 3.86 | households | 76043 | yes | yes | 3 | 3.86 | 76 | female | white | no | yes, literate | primary completed | primary (6 yrs) completed | inactive | inactive (not in labor force) | no, not in the labor force |
| 6 | 51705152 | venezuela | 2001 | venezuela 2001 | 4.073540e+08 | 2 | 10.00 | households | 862015 | no | no | 2 | 10.00 | 30 | male | NaN | NaN | yes, literate | primary completed | lower secondary general completed | employed | at work | yes, in the labor force |
| 7 | 20333973 | brazil | 2010 | brazil 2010 | 4.811139e+09 | 2 | 3.05 | households | 76041 | niu (not in universe) | no | 2 | 3.05 | 70 | female | white | no | no, illiterate | less than primary completed | no schooling | inactive | inactive (not in labor force) | no, not in the labor force |
| 8 | 45447027 | mexico | 2015 | mexico 2015 | 2.851823e+09 | 13 | 6.00 | households | 484031 | no | no | 3 | 6.00 | 28 | female | NaN | yes | yes, literate | primary completed | primary (6 yrs) completed | inactive | housework | no, not in the labor force |
| 9 | 52338114 | venezuela | 2001 | venezuela 2001 | 5.835450e+08 | 7 | 10.00 | households | 862023 | no | no | 6 | 10.00 | 18 | male | NaN | NaN | yes, literate | secondary completed | secondary, general track completed | unemployed | unemployed, new worker | yes, in the labor force |
Last rows
| df_index | country | year | sample | serial | persons | hhwt | gq | geolev1 | internet | computer | pernum | perwt | age | sex | race | indig | lit | edattain | edattaind | empstat | empstatd | labforce | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9990 | 8471333 | brazil | 2010 | brazil 2010 | 1.184290e+09 | 2 | 3.83 | households | 76025 | niu (not in universe) | no | 2 | 3.83 | 53 | female | brown (brazil) | no | no, illiterate | less than primary completed | some primary completed | inactive | inactive (not in labor force) | no, not in the labor force |
| 9991 | 7453645 | brazil | 2010 | brazil 2010 | 8.984330e+08 | 8 | 5.23 | households | 76023 | niu (not in universe) | no | 6 | 5.23 | 15 | female | brown (brazil) | no | yes, literate | primary completed | primary (6 yrs) completed | inactive | inactive (not in labor force) | no, not in the labor force |
| 9992 | 2248829 | argentina | 2010 | argentina 2010 | 7.217800e+08 | 8 | 10.00 | households | 32018 | NaN | no | 6 | 10.00 | 11 | female | NaN | NaN | yes, literate | less than primary completed | some primary completed | niu (not in universe) | niu (not in universe) | niu (not in universe) |
| 9993 | 5912698 | brazil | 2010 | brazil 2010 | 4.900580e+08 | 4 | 8.72 | households | 76021 | niu (not in universe) | no | 3 | 8.72 | 8 | male | white | no | yes, literate | less than primary completed | some primary completed | niu (not in universe) | niu (not in universe) | niu (not in universe) |
| 9994 | 42426758 | mexico | 2015 | mexico 2015 | 2.051680e+09 | 1 | 10.00 | households | 484021 | no | no | 1 | 10.00 | 60 | male | NaN | no | yes, literate | university completed | university completed | inactive | retirees and living on rent | no, not in the labor force |
| 9995 | 26255366 | colombia | 2005 | colombia 2005 | 3.979800e+07 | 9 | 4.64 | households | 170005 | NaN | no | 4 | 4.64 | 21 | male | white | no | yes, literate | less than primary completed | some primary completed | employed | at work | yes, in the labor force |
| 9996 | 4184185 | brazil | 2010 | brazil 2010 | 6.417800e+07 | 5 | 8.83 | households | 76012 | niu (not in universe) | no | 3 | 8.83 | 91 | female | white | no | no, illiterate | less than primary completed | no schooling | inactive | inactive (not in labor force) | no, not in the labor force |
| 9997 | 30656672 | dominican republic | 2010 | dominican republic 2010 | 3.642400e+07 | 2 | 10.00 | households | 214018 | no | yes | 1 | 10.00 | 28 | male | NaN | NaN | yes, literate | primary completed | primary (6 yrs) completed | employed | employed, not specified | yes, in the labor force |
| 9998 | 9051171 | brazil | 2010 | brazil 2010 | 1.349303e+09 | 6 | 5.47 | households | 76026 | niu (not in universe) | no | 2 | 5.47 | 45 | female | white | no | no, illiterate | less than primary completed | some primary completed | inactive | inactive (not in labor force) | no, not in the labor force |
| 9999 | 46094153 | nicaragua | 2005 | nicaragua 2005 | 7.503100e+07 | 5 | 10.00 | households | 558055 | no | no | 1 | 10.00 | 44 | female | NaN | no | yes, literate | primary completed | lower secondary general completed | employed | at work | yes, in the labor force |